Automatic Categorization of Software Modules

نویسندگان

  • Parvinder Singh Sandhu
  • Hardeep Singh
چکیده

The world of software has demonstrated the remarkable appeal of communal software development. Large number of software projects can leverage, reuse, and coordinate their work through internet and web-based technology. For example, Source-Forge currently hosts about sixty thousand software systems, similar strategies have suggested for corporate software development. With thousands of projects, manually locating related projects can be difficult. Hence to use automatic software categorization to find clusters of related software projects using only the source code from projects, automatic categorization of software experiments with a set of programs. Automatic categorization of software systems is a novel and intriguing challenge on software archive. Evolution has focused on determining intracomponent relations of given software system also increase to differentiate between categories. Function oriented produces better result than the object oriented. Automatic categorization of software has provided better results than LSA retrieval techniques in terms of Precision and Recall with multinomial Naïve Bayes scheme has outperformed all other approaches and shows better results than the existing approach (SVD), being used by some open source code repositories e.g. Source forge Hence, the tool can also be utilized for the automatic categorization of software components and this kind of automation may improve.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Software Repositories for Defect Categorization

Early detection of software defects is very important to decrease the software cost and subsequently increase the software quality. Success of software industries not only depends on gaining knowledge about software defects, but largely reflects from the manner in which information about defect is collected and used. In software industries, individuals at different levels from customers to engi...

متن کامل

An Overview of Automatic Audio Segmentation

In this report we present an overview of the approaches and techniques that are used in the task of automatic audio segmentation. Audio segmentation aims to find changing points in the audio content of an audio stream. Initially, we present the basic steps in an automatic audio segmentation procedure. Afterwards, the basic categories of segmentation algorithms, and more specific the unsupervise...

متن کامل

ToxiCat: Hybrid Named Entity Recognition services to support curation of the Comparative Toxicogenomic Database

We report on the original implementation of named entity recognition (NER) modules based on an automatic text categorization pipeline, so-called ToxiCat (Toxicogenomic Categorizer), developed to perform biomedical documents classification and prioritization for the previous Biocreative campaign in order to speed up the curation of the Comparative Toxicogenomics Database (CTD). ToxiCat NER modul...

متن کامل

A K-Means Based Clustering Approach for Finding Faulty Modules in Open Source Software Systems

Prediction of fault-prone modules provides one way to support software quality engineering. Clustering is used to determine the intrinsic grouping in a set of unlabeled data. Among various clustering techniques available in literature K-Means clustering approach is most widely being used. This paper introduces K-Means based Clustering approach for software finding the fault proneness of the Obj...

متن کامل

Automatic Categorization Tool for Open Software Repositories

The world of Open Source software has demonstrated the remarkable appeal of communal software development. Large number of software projects can leverage, reuse, and coordinate their work through Internet and web-based technology. For example, SourceForge currently hosts about sixty thousand software systems. Similar strategies have been suggested for corporate software development, through not...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007